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Method for the selection of biomolecules from biomolecule variant libraries 

The invention concerns a method for the selection of biomolecules from biomolecule variant 
libraries, in particular of enzymes or other biocatalytically active biomolecules. Biomolecules 
find manifold use in the technical or medicinal applications and processes. Many of the 
therefore needed properties of biomolecules are not present in nature or could not yet be 
identified. The generation of such new properties from existing biomolecules demands the 
production of very large variant libraries with stochastically changed compositions by the 
introduction of mutations. The identification of variants with the desired properties needs 
suitable selection- or screening-methods. 

The stochastically introduction of mutations into the genetic material is also the incitement of 
natural evolution. Natural systems replicate with mutation rates, which lay curtly under the so 
called error threshold. The error threshold is the maximal mutation rate, which just not leads 
to an extinction of the population. With mutation rates below the error threshold sufficient 
variations are accumulated in the library to allow the population a fast adaptation to altered 
conditions. Mutation rates above the error threshold after some generations bring forth, that 
no survivable and accordingly replicatable individuals are present anymore, und the 
population collapses (Eigen, M., McCaskill, J., Schuster, P.: The molecular quasispecies. 
Adv. Chem. Phys. 1989, 75, 149-263). 

New biomolecules can be produced by a linkage of the new property to the survival or a 
sufficiently large growth advantage of an organism. At this the variant library is transferred 
into a corresponding organism and the growth conditions are chosen in a way, that only the 
organisms survive or comparatively grow faster, which produce a variant of the biomolecule 
with the wanted new property (Zaccolo, M, Gherardi, E.: The effect of high-frequency 
random mutagenesis on in vitro protein evolution: a study on TEM-1 beta-lactamase. J. Mol. 
Biol. 1999. 285, 775-83. or Samuelson, J.C., Xu, S.Y.: Directed evolution of restriction 
endonuclease BstYI to achieve increased substrate specificity. J. Mol. Biol. 2002. 319,673- 
83). This application is only applicable to a narrowly limited circle of biomolecules, which 
provide an advantage to a chosen organism. Biomolecules, which catalyze arbitrary chemical 
reactions, cannot be selected in this way. Since the organism needs to remain alive during the 
whole selection process, toxic or otherwise for the growth disadvantageous properties cannot 
be selected. 
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Another method for the selection of new biomolecules is the linkage of the biomolecule to the 
coding nucleic acid sequence (Amstutz, P., Forrer, P., Zahnd, C, Pluckthun, A.: In vitro 
display technologies: novel developments and applications. Curr. Opin.Biotechnol. 2001. 12. 
400-5. Xia, G., Chen, L., Sera, T., Fa, M., Schultz, P.G., Romesberg, F.E.: Directed evolution 
of novel polymerase activities: mutation of a DNA polymerase into an efficient RNA 
polymerase. Proc. Natl. Acad. Sci. USA. 2002. 99. 6597-602. Pschorr, J.: Genotyp und 
Phanotyp koppelnde Verbindung. DE001 96463 72C1). An application of these technologies 
with living organisms like phages or bacteria limits the spectrum again to non-toxic or not 
growth inhibiting biomolecules. Also the substrates and products of the wanted reaction may 
not have any damaging effect to the presenting organism. Additionally catalytic activities can 
only be selected if biomolecule and substrate can be presented at the same organism. As the 
activity of the catalytic biomolecules cannot be limited to the organism, which presents them, 
and they therefore also take place reactions at other individuals of the library, this method 
often leads to false selection of biomolecules. 

In dissection methods (screening methods) every variant of a biomolecule library is analyzed 
separately regarding the wanted property (Joo, H., Lin, Z., Arnold, F.H.: Laboratory evolution 
of peroxide-mediated cytochrome P450 hydroxylation. Nature. 1999. 399. 670-3. Korbel, 
G.A., Lalic, G., Shair, M.D.: Reaction microarrays: a method for rapidly determining the 
enantiomeric excess of thousands of samples. J. Am. Chem. Soc. 2001. 123. 361-2). Even 
with very short measurement times (e.g. 100 msec per variant) this methods demands a high 
time expense (e.g. 22 days) for the analysis of large libraries (e.g. 10 7 ). The continuous 
measurement of variants in these dimensions needs the setup of appropriate complex 
apparatuses. Besides for every variant of the library a corresponding property test needs to be 
run, what leads to very high costs of these methods. 



To screen or to change enzymatic properties in the laboratory, the so-called "enzyme 
engineering", according to the state of the art within an enzyme library genotype (a nucleic 
acid, which can be amplified and comprises a variant of a gene) and phenotype (a functional 
feature, for example a catalytic property) need to be coupled together. This coupling for 
instance is realized through techniques like phage display or ribosome display or thereby, that 
30 each genotype is testing individually for its phenotype. 

The aim of the present invention is to give a method to identify biomolecules in variant 
libraries of biomolecules. 



According to the present invention the aim is solved by a method for the identification of 
biomolecules in variant libraries of biomolecules comprising the steps: 

a) Production of a variant library, consisting of a number of variants (B 0 ) of gene sequences 
coding for the biomolecule, 

b) Division of the variant library into a number of compartments (W 0 ), which is smaller than 
the number of variants in the variant library (B 0 ) preferentially by a factor of ten, more 
preferentially by a factor of 1 00, 

whereas each compartment contains a partial library which contains Ko=BoAV 0 variants, 

c) Production of biomolecules in the compartments and testing of the biomolecules obtained 
in the single compartments for a specified property (phenotype), preferentially a biocatalytic 
activity, whereas from the observed phenotype no direct conclusions on the genotype can be 
made, 

d) Selection of at least one compartment, which contains biomolecules fulfilling the wanted 
property, preferentially a biocatalytic activity, 

e) Division of the partial library contained in the selected library into further compartments 
corresponding to step b) and 

f) n-fold repetition of steps c) to e) until in every compartment maximally only one variant 
(K„<=1) of the gene sequence coding for the biomolecule is contained. 

This method is especially suitable for the generation of biomolecules with new catalytic 
activities, which either do not exist in nature or at least cannot be catalyzed by the starting 
biomolecule. Furthermore with this method existing catalytic activities can be adapted to 
exterior conditions like for example temperature or solvent, under which no or only little 
activity was present. 

As in the present invention the production of the biomolecules can lead to a die off of the 
organisms or can be carried out by cell-free systems, the method can be applied to all kind of 
biomolecules and is not limited to non-toxic or not growth inhibiting activities. As up to a 
million or more variants are analyzed with one test and simultaneously for the corresponding 
property, the time needed for the screening of the library and the costs needed for the property 
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tests are reduced by a corresponding factor. Variants, which possess the wanted properties, 
can by isolated from the original variant mixture in a secure and reproducible way. 

In the step a) of the method a variant library of gene sequences coding for the biomolecule is 
produced by standard molecular biology processes. 

According to the present invention among a variant library is conceived: A mixture of 
proteins or nucleic acids, which differ from each other at least in one position of their 
sequence. 



Preferentially the variant library consists of a number of variants in the dimension of B 0 = 10 3 
to Bo = 10 15 . For example within a partial area of the biomolecule randomly chosen sequence 
modules can be introduced, so that in case of a nucleic acid with 25 altered positions a library 
size of 4 25 = 1.1 x 10 15 or in case of a protein with 7 altered positions a library size of 20 7 = 
1.3 x 10 9 originates. 

More preferentially the dimension lies in the range between B 0 = 10 5 to B 0 = 10 9 . 

More preferentially the variant library consists of DNA-pIasmids or linear nucleic acid 
molecules, which contain the gene sequence coding for the biomolecule. 

According to the present invention biomolecules are proteins, nucleic acids or other 
biopolymers consisting of organic building blocks. Preferentially these are biomolecules, 
enzymes or ribozymes or other biomolecules, which as biocatalysts accelerate the conversion 
of chemical or biochemical substances. 

Standard molecular biology methods, with which such variant library can be produced, are for 
example defective amplification techniques for nucleic acids. For this purpose replicating 
enzymes, e.g. polymerases, which conduct the novel synthesis of a biomolecule with the help 
of a template, are used. The introduction of mistakes and the thereby generation of different 
variants is achieved by the naturally existing error rate of these replicating enzymes or can be 
increased by changing the reaction conditions (e.g. imbalance of the synthesis building 
blocks, addition of building block analogues, alteration of the buffer conditions). Besides the 
introduction of mistakes a variant library can be obtained by using the natural occurring 
diversity to originate a specific biomolecule or a class of biomolecules. 
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In comparison to conventional screening methods the process according to the present 
invention allows the screening of very large libraries. The division process according to the 
present invention allows the simultaneous testing of an arbitrary number of variants. 

The size of the library is only limited by the sensitivity of the assay, with which the 
5 biomolecules contained in the single compartments are tested for a specified property, 
preferentially a biocatalytic activity, in step c) of the process. 

Preferentially the libraries are produced by error-prone PCR or by the introduction of 
synthetically randomized sequence regions (Cadwell, R.C., Joyce, G.F.: Randomization of 
genes by PCR mutagenesis. PCR Methods Appl. 1992. 2. 28-33; Wells, J. A., Vasser, M., 
10 Powers, D.B.: Cassette mutagenesis: an efficient method for generation of multiple mutations 
at defined sites. Gene. 1985. 34. 315-23). 

In the process the mutation rate preferentially is chosen far beyond the error threshold. 
Thereby within the starting library preferentially more than 90%, more preferentially more 
than 99% and even more preferentially more than 99.9% of the generated variants are not 
1 5 survivable. 

The error threshold is defined as the maximal mutation rate, which in evolutionary methods 
(cyclic application of mutation and selection) just not leads to a melting of the genetic 
information and thereby retains the survivability of a population. A melting of the genetic 
information is defined as a process, in which by a repeated appliance of a too high mutation 
20 rate in the replication of a nucleic acid so many mutations accumulate, that the nucleic acid 
does not contain any physiologically meaningful information anymore. 

The survivability of a gene and accordingly a gene product is thereby defined in the way, that 
the gene and accordingly its gene product still is able to perform its physiological activity like 
for example the binding of a partner or the catalytic cleavage of a substrate. 

25 An important advantage of the present invention in comparison to conventional methods, 
which contain mutagenesis and selection steps, consists therein, that in the process according 
to the present invention one starts from a large library, which a priori contains the wanted 
variant. That means that after the screening one does not obtain a suboptimal variant, which 
needs to be further improved through additional cycles of mutation and recombination. 
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The method according to the present invention is characterized thereby that in the beginning 
one-time in step a) a variant library is generated, which subsequently is screened for variants 
with the wanted property. From step b) on no additional mutation or recombination steps take 
place. That means that in between or during the individual singling steps (steps b. to f.) the 
isolated partial libraries do not undergo a further mutagenesis or recombination. That means 
that the variants which are isolated at the end of the process with the wanted properties are 
already present in the initially (in step a.) applied library. 

Preferentially the process according to the present invention is conducted in a way that in step 
d) in all passages only one compartment is chosen namely that one in which the wanted 
property (phenotype) is strongest distinct, preferentially the compartment with the strongest 
catalytic activity. Thereby with the process according to the present invention the best variant 
can be isolated, in which the wanted property (phenotype) is strongest distinct, without the 
obligatory necessity of selecting suboptimal variants or groups of variant. 

At the production of the variant library one preferentially starts from an already known 
nucleic acid or protein sequence, consecutively called starting sequence. Based on this 
starting sequence the variant library is produced by the above mentioned methods (e.g. error- 
prone PCR or by the introduction of synthetically randomized sequence regions). 

The method according to the present invention is characterized thereby that the starting 
sequence does not need to be contained in the variant library. 

20 The starting sequence often codes for a phenotype which is to a certain degree similar to the 
wanted property. So one would, for example when one wants to obtain an RNase as the 
wanted phenotype, which cleaves after an adenosine, chose for instance an RNase as the 
starting sequence, which cleaves after a guanosine (and not a protease or so). 

However the more similar the starting sequence is to the phenotype of the wanted property the 
larger however is usually the background activity within the test in step c) of the process. 
Advantageously this background is avoided, when the starting sequence is not present in the 
variant library anymore. 

Preferentially the variant library is produced in a way that the starting variant is not contained 
in the variant library anymore. This for example can be achieved thereby that a stop codon is 
introduced into the starting sequence, which is removed again by the introduction of mutated 
regions into the starting sequence. Thereby it can be assured that eventually protracted 
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starting sequences are because of the stop codon physiologically not active and that on the 
other side physiologically active variants need to contain mutated regions. 

In opposite to the in the state of the art applied high-throughput processes the method 
according to the present invention allows the screening of about multiples larger libraries in a 
5 fraction of the time. In comparison to in vivo selection methods the method according to the 
present invention is also not limited to specified enzyme classes and specified enzyme 
properties respectively. 

In step b) the variant library is divided up into a number of compartments W 0 , which is 
smaller than the number of variants contained in the variant library at least by a factor of 10, 
1 0 preferentially by a factor of 1 00. 

At this before the division the variant library can be transformed into an organism or the 
division can be conducted on the level of the coding sequences. The division is done in a way 
that each variant of the library occurs at least once, preferentially exactly once. 

The then in step c) conducted production (expression) of the biomolecules is done 
preferentially by the organism or by in vitro expression systems (e.g. cell extracts). 

As expression organisms which are used regularly in molecular biology for the expression of 
biomolecules, like proteins, can be used, the expression organism is chosen depending on the 
biomolecule which needs to be expressed. Preferred expression organisms are bacterial cells 
(e.g. E. coli, B. subtilis) or eukaryotic cells (e.g. S. cerevisiae, insect cells, tumor cells). 
By the transformation of the variant library into the expression organism single clones 
originate. Thereby every clone contains one defined genotype respectively that is one variant 
of the gene sequence coding for the biomolecule. According to the present invention one 
clone can also be defined as a sole coding sequence that is a defined genotype without 
expression organism. 
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The transformation into an organism is done with known molecular biology methods for the 
transformation of gene sequences into expression organisms and depends on the expression 
organism used. A preferred method is electroporation. 

Preferentially the division into compartments is done immediately after the transformation of 
the variant library into the expression organism. 
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The number of the compartments W 0 amounts to preferentially between 10 1 and 10 4 
compartments and more preferentially to between 96 und 1536 compartments. 

The library size B 0 divided by the number of compartments W 0 gives the clone number per 
compartment Ko = B 0 / W 0 . 

Every compartment contains a partial library with the number of Ko variants of the gene 
sequence coding for the biomolecule. 

The division particularly preferentially is done into compartments of a microtiter plate and a 
deep well plate respectively. 

Preferentially in step c) an amplification of the partial libraries in the compartments is carried 
out by a growth of the organisms or by an amplification of the coding sequences by template- 
depending enzymes up to a number of individuals V 0 per compartment and the production of 
the catalytic biomolecules is carried out by the expression organisms or by cell-free 
expression systems like for example E. coli lysates, reticulocyte lysates, C. lucknowese lysates 
or insect cell lysates. 

Preferentially a conservation of a part of the partial library on the level of organisms or on the 
level of the pure coding sequences at the point in time x under retention of the compartment 
allocation is carried out. 

The conservation is carried out preferentially by the production of a 1:1 mixture of the 
organism culture and glycerol and storing of that mixture under growth inhibition at -80°C. A 
conservation on the level of the coding sequences is carried out by taking ofF a part of the 
amplified sequences and storage, preferentially at -20°C. 

A determination of the number of individuals V 0 (x) of the conserved partial library on the 
level of organisms is preferentially carried out by measuring the optical density OD of a 
liquid organism culture and correlation with the number of individuals or by transferring an 
aliquot of this culture to a solid medium and counting the thereof resulting colonies. The 
determination of the number of individuals V 0 (x) of the conserved partial library on the level 
of the coding sequences is carried out preferentially by determining the concentration with 
spectroscopic methods. 

The number of individuals V 0 (x) divided by the number of clones per compartment Ko gives 
the amplification factor F 0 (x) per clone, F 0 (x) = V 0 (x) / Ko. 
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In step c) of the process the biomolecules contained in the single compartments are tested for 
a specified property (phenotype), preferentially for a biocatalytic activity. 

In step c) an amplification of the partial library in the compartments is preferentially carried 
out up to a number of individuals V 0 (x) at the point in time x per compartment, whereas the 
number of individuals divided by the number of clones per compartment K<> gives the 
amplification factor F 0 (x) per clone. 

Before, during or after the growth of the organisms or the amplification of genotypes the 
production of the biomolecules is carried out thereby in the single compartments. 

Preferentially the test is carried out for a biocatalytic activity by incubating the catalytically 
active biomolecules contained in the compartments or isolated from them with corresponding 
substrates and allocating activity values to the corresponding compartments. Compartments, 
in which the activity value exceeds a defined barrier, are assessed as positive. 
As each compartment contains more than one clone of the variant library, no conclusion can 
be made from the observed phenotype to the genotype, because the observed phenotype 
results from the sum of clones contained in the compartment. 

Although therefore in the method according to the present invention genotype and phenotype 
are decoupled, the clone responsible for the wanted property, which for instance comprises 
the wanted enzymatic activity, can be retrieved and isolated from the mixture of clones with 
the method according to the present invention. That it is possible to retrieve the clone 
responsible for the wanted property from the mixture of clones with a screening method, in 
which genotype and phenotype are decoupled, is surprising to persons skilled in the art, as all 
known screening methods base on the coupling of genotype and phenotype. 

To retrieve the clone with the wanted property is achieved with the steps d) and e) of the 
method according to the present invention. 

In step d) of the process at least one compartment is chosen, which contains biomolecules, 
which fulfill the wanted properties. 

Preferentially therefore the partial library or the corresponding conserved partial library is 
diluted by the means of factor F 0 (x), so that in a given volume each clone contained in the 
compartment statistically occurs up to a number of Xo < W,. This volume in turn is divided up 
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into a number W, of new compartments without a prior amplification. The new number of 
clones per compartment is K, = Xo * Ko / Wi . 

Now the steps c) to e) of the process are repeated as often as the number of clones per 
compartment K n < 1. As soon as K„ < 1, the wanted phenotype can be allocated to a discrete 
genotype. 

In order to avoid the loss of single clones and thus of variants of the library of biomolecules, 
the step e) preferentially is conducted in a way that in the first passages of steps e) 1< X n ., < 
Wi applies, preferentially X„.j = 3 to 5. 

Step e) preferentially is repeated as often as the clone causing the wanted property is to be 
found in the new compartmented partial library. At this in the last passage of step e) X n 
preferentially is < 1 . Therefore the partial library preferentially is diluted in the last passage of 
step e) in a way that maximally one clone can be found per compartment and that in many 
compartments no clone is contained. Therewith an average number of X n < 1 results. 

In step 0 the steps c) to e) are repeated n-fold until in each compartment maximally only one 
variant (K„ <= 1) of the gene sequence coding for the biomolecule is contained. 

Die number of necessary repetitions n is depending on the number of variants (B 0 ) of the in 
step a) constituted variant library, the number of compartments (W n ) in which the library is 
divided up in step b) and e) und the number X n , with which a once retrieved clone will again 
be present in the next cycle. The number of conducted repetitions n thereby amounts to with a 
preferentially constant X n = 1 and constant W n : 

n = log 10 (B 0 ) - log 10 (W n ) oder n = (log,o(B 0 ) - log 10 (W n )) + 1 , 
whereas n eventually is rounded up to the next larger whole number. 

If in step a) for example a library with B 0 = 10 6 variants is constituted und if the partial 
libraries in step b) and e) are divided up with X„ = 1 in W„ = 96 or W n = 100 compartments 
respectively, than n = 4 to 5 passages of the steps c) to e) are necessary in order to retrieve the 
clone with the wanted property. 
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With the consecutive execution examples the invention is illustrated in detail: 

Execution example 1 describes exemplarily the selection of active RNase Tl from a variant 
library of inactive variants of RNase Tl . 

Execution example 2 describes exemplarily the selection of an adenosine cleaving RNase Tl 
from a library of RNase Tl variants. 



Execution example 1 

1. Cloning the genes o f RNase Tl wildtvpe and His92Ala 

With the two primers A2Vo_BspHI (SEQ_ID No. 1) and A2Hi_PstI (SEQJD No. 2) (both 
from IBA Goettingen, Germany) the genes coding for RNase Tl wildtype (SEQJD No. 3) 
and for RNase Tl variant His92Ala (SEQJD No. 4) including the signal peptide for a 
periplasmatic expression were amplified from the corresponding source vectors pA2Tl 
(SEQJD No. 5) und pA2Tl_H92A (SEQJD No. 5, in which SEQJD No. 3 is replaced by 
SEQJD No. 4) by a PCR under the following conditions: 

1.1 PCR: 



PCR-reaction: 



1 0 pi 1 Ox VENT-buffer (NEB, Beverly, USA) 

2 pi dNTPs (each 1 0 mmol/liter) 

1 00 pmol Primer A2 Vo_BspHI (SEQJD No. 1 ) 

1 00 pmol Primer A2Hi_PstI (SEQJD No. 2) 

1 pl original vector (20 ng) (SEQJD No. 5) 

2 U VENT-Polymerase (NEB) 
adlOOpl H 2 Odest. 



PCR temperature profile: 



2 min / 94 °C 

1 . 45 sec / 94 °C (denaturation) 

2. 45 sec / 57 °C (annealing) 

3. 30 sec / 72 °C (elongation) 
2 min / 72 °C 



25 x 
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The resulting PCR-products were purified with the QIAquick PCR-purification-kit (Qiagen, 
Hilden, Germany) following the manufacturers instructions. 

1 .2 Restriction digest: 

In order to clone the genes into the expression vector pETBlue-2 (SEQ_ID No. 6) the PCR- 
products and the vector were incubated with restriction endonucleases BspHI and PstI and 
Ncol and PstI (all from MBI Fermentas, Vilnius, Lithuania) respectively as follows: 

Restriction digest reactions: 



PCR-Products: 


Vector: 




2 ug PCR-product 


4ug 


pETBlue-2 


2 ul lOx buffer 0 + (MBI) 


2 ul 


lOx buffer Y + (MBI) 


10 U BspHI 


10U 


Ncol 


10 U PstI 


10U 


PstI 


ad 20 ul H 2 Q dest. 


ad 20 ul 


H 2 Q dest. 



The restriction digest reactions were incubated for 2 h at 37 °C. To the „vector-reaction" 
subsequently for the dephosphorylation 1 U SAP (MBI Fermentas, Vilnius, Lithuania) is 
added and incubated for additional 30 min at 37 °C. Afterwards the enzymes get inactivated 
for 20 min at 80 °C. Hereupon the products are purified with the QIAquick PCR-purification- 
kit (Qiagen, Hilden, Germany). 



1.3 Ligation, transfor mation into E. coli and plasmid-orep aration 

The vector-DNA and the PCR-product are ligated by the incubation with T4-DNA-ligase 
follows: 



as 



Ligase-reaction: 200 frnol Vector-DNA 

600 frnol PCR-Product 

3ul 10xLigase-buffer(MBI) 

1 ul T4-DNA-ligase 

ad 30 ul H 2 0 dest. 

The reactions are incubated for 8 h at 16 °C and the enzyme is subsequently inactivated by a 
10 minute incubation at 65°C. 1 ul of this reaction was directly used for the transformation of 
commercially available competent ElectroTen-cells (Stratagene, La Jolla, USA) with 
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electroporation. The electroporated cells were plated on agar plates with ampicillin and 
cultivated over night at 37°C. Starting from a resulting single colony the ready plasmid was 
re-isolated with the plasmid-purification kit QIAprep Minipreparation-kit (Qiagen, Hilden, 
Germany) following the manufacturers instructions. 

1 .4 Production of a plasmid mixture as RNase Tl-test jibraryj 

As result from the preceding steps the two plasmids pETBlue-RNaseTl-wildtype and 
pETBlue-RNaseTl -His92Ala are obtained. 

In order to produce the test library the plasmid are mixed as follows: 

1 pg PETBlue-RNaseTl-wildtype is mixed with 1 ug pETBlue-RNaseT 1 -His92 Ala. Thereby 
one obtains a relation of 1 : 1,000,000 RNase Tl wildtype (active) to the variant His92Ala 
(inactive). 

1.5 Production of the expression strain- 

For the expression of the RNase Tl-test library an E. coli strain is needed, in which the 
RNase I is knocked out. Corresponding strains like for example AT9 (mal9 X gdhA2 relAl 
spoTl metBl) are available via the E. coli Genetic Stock Center New Haven, USA. The 
expression vector pETBlue-2 used in the example additionally needs the T7-RNA-polymerase 
for the expression, which is not present in E. coli. With the commercially available A.DE3- 
Lysogenisation-kit (Novagen, Madison, USA) the T7-RNA- P olymerase coding gene is 

introduced into the strain AT9. Through this an E. coli-strain is obtained, which is 

characterized by the absence of RNase I and the presence of the T7-RNA-polymerase (DE3). 

Electrocompetent cells were prepared from this strain with standard molecular biology 

methods and stored at -80°C. 

1.6 Transformati on o f t h e ex pression strai n with the test Hhrary- 

Into the strain produced as precedent described one ng of the plasmid mixture as a test library 
was transformed via electroporation and the resulting cells were taken up into 10 ml liquid 
medium (LB-medium: 10 g Tryptone, 5 g yeast extract (all from Becton Dickinson, 
Hcdelberg, Germany), 10 g NaCl (from Sigma, Deisenhofen, Germany)) containing 
ampicillin after 1 hour incubation at 37°C. 
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The in this way obtained preparatory culture is immediately divided on a 96-well microliter 
plate (MTP) (100 ul per well) and incubated at 30°C and 800 rpm over night. 

By the transformations with electroporation approximately 3 million transformed clones are 
obtained. 

1.7 Growth of the main culture and expression of RNase T1 

A 96-well deep well plate (DWP) is filled with 1.5 ml liquid medium with ampicillin per well 
respectively. The medium is inoculated with 50 pi from the preparatory culture respectively 
and the DWP is cultured at 37°C and 800 rpm. When an optical density OD^ of the cultures 
of ODsoo = 1.0 is reached the cultures are induced with 1 mmol/liter IPTG. Afterwards the 
plate is incubated for additional 4 h at 37°C and 800 rpm. 

1.8 Preparation of protein samples 

By the signal peptide ompA the expressed RNase Tl -molecules are directed into the 
periplasmatic space of the expression bacterium. Through an osmotic shock the protein can be 
prepared very easily. The purification procedure comprises the following steps: 

© Collection of the cells by centrifugation at 4000 rpm, 4°C for 5 min, 
© Decantation of the medium supernatant, 

© Resuspension of the bacterial pellet in 25 pi buffer A (50 mmol/liter Tris/HCl, pH 7.5, 

10 mmol/liter EDTA, 15 % Saccharose w/v) respectively, 
© Incubation on ice for 30 min, 

© Addition of 125 pi buffer B (50 mmol/liter Tris/HCl, pH 7.5, 10 mmol/liter EDTA) 
respectively, 

© Centrifugation at 4000 rpm, 4 °C, for 20 min, 

© Removal of the supernatant and transfer into a MTP (periplasm), 

© Storage of the bacterial pellet. 

1.9 Production of the substrate for RNase Tl 

As a substrate (Sub_G) a double stranded DNA-molecule with a central single stranded area 
was used, which contained a guanosine-RNA-Building block as point of attack for the 
enzyme. The ends of this substrate are labeled with differing dyes for the red (Cy5 at the 5'- 
end) and the green (RhG at the 3'-end) spectral range. In order to avoid a bleaching of the 
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labeled substrate the corresponding solutions and incubation reactions are protected from 
light. The buffers and reactions were produced with DEPC-treated water. The substrate is 
composed of the following three oligonucleotides (IBA Goettingen, Germany): 

1. Sub_G: 

5'-Cy5-CCATACCAGCCAGCCACAArGCAAGCCACCGAAGCACAGATA-RhG-3' 

(SEQJDNo. 10) 

2. Tl_Sub_Li: 

5 '-GTGGCTGGCTGGTATGGA-3 ' (SEQ_ID No. 7) 

3. Tl_Sub_Re: 

5 ' -TATCTGTGCTTCGGTGGC-3 ' (SEQ_ID No. 8) 

By the consecutively described hybridisation the three components are annealed to a double 
stranded substrate: 

Hybridisation reaction: Hybridisation program: 

1 000 pmol Sub_G 1 . l o sec 94°C; 

1 200 pmol Tl_Sub_Li 2 . Cooling to 25 °C with 0. 1 °C/sec 

1 200 pmol T 1 Sub Re 3 . 4 °C 
20 pi MES (1 mol/liter, pH 6.0) 
ad 1000plDEPC-H 2 O 

1.10 Incubation of the protein samples with the substrate 

In a MTP 10 pi of the double stranded substrate are provided per well respectively. Thereto 
10 pi of the protein samples isolated from the periplasm are added respectively, the MTP is 
sealed air-proof and incubated for 24 h at 37°C in the dark. Afterwards 5 pi of the reactions 
are transferred into a MTP with glass bottom respectively and mixed with 250 pi buffer C 
respectively (100 mmol/liter MES, pH 6.0, 100 mmol/liter NaCl, 2 mmol/liter EDTA). 

1.11 Acti vity determination 

In order to determine the enzyme activity the plate with the glass bottom, into which the 
incubation reactions were transferred as described in 1.10, was measured on the fluorescence 
correlation spectroscope ConfoCor 2 (Evotec Biosystems, Hamburg, Germany and Carl Zeiss 
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Microscopy, Jena, Germany). The evaluation of the date was conducted using the ConfoCor 
2-software (version 2.5). 

For the measurements an Argon-laser (1 = 488 nm) is used for the excitation of RhG in 
combination with a helium/neon-laser (1 = 633 nm) for Cy5. The FCS measurement volume in 
the cavities was adjusted 200 urn above the glass surface. The measurements were conducted 
for 20 sec per well. 

By a cross correlation analysis of the obtained data one can conclude on an eventual cleavage 
of the substrate. A cleavage of the substrate by RNase Tl leads to a decoupling of both 
fluorescent dyes and therefore to a loss of the cross correlation signal. Uncut substrate 
molecules in contrast carry both dyes and deliver a strong signal. 

By the division of the 3 million clones obtained by transformation und through the mixture 
relation between active RNase Tl wildtype and inactive RNase Tl His92Ala of 1 : 1,000,000 
theoretically three wells with activity should be detectable with measurements. Statistical 
deviations between 1 to 5 wells with activity are however possible. 

Figure 1 shows the thus obtained data for an RNase Tl-test library produced as described in 
point 1 to 1.11 consisting of 3 million clones on one plate with a mixture relation of RNase 
Tl-wildtype to RNase Tl-His92Ala of 1 : 1,000,000. The RNase Tl-activity was detected as 
described above via cross correlation analysis. For a better overview a reciprocal view was 
chosen, that means that high peaks mean a low signal and low peaks a high signal. Fig. 1 
shows 2 clear peaks, which are caused by a loss of the cross correlation signal. These two 
peaks indicate that in the experiment an RNase Tl-activity in two of 96 wells securely was 
present. 

2. Re-isolation of the partial library 

In the plate obtained in section 1 a plasmid preparation is conducted with the stored bacterial 
pellets from the protein preparation using the QIAprep Minipreparation-kit (Qiagen, Hilden, 
Germany) with one of the wells, which showed an RNase Tl-activity in the activity 
measurement (section 1.11), 

By the original division of 3 million clones on the plate a number of 3,000,000 / 96 = 31,250 
different clones per well resulted. Therefore a mixture relation from RNase Tl wildtype to 
RNase Tl His92Ala of 1 : 32,250 consists in the isolated partial library. 
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2.1 Additional separating^ 

Through a transformation of different aliquots of the thus obtained partial library in analogy 
to section 1.6 the amount of plasmid DNA was determined, which is necessary to now obtain 
about 100,000 transformed clones via electroporation. 

Afterwards the determined amount of the partial library is transformed into the expression 
strain and the same process as for the test library is conducted. As about 100,000 clones were 
divided up and the new mixture relation was 1 : 32,250, again theoretically three wells with 
detectable activity were expectable. 

The plasmids were again re-isolated from the bacterial pellets from one of the wells with 
activity. The mixture relation in this again enriched partial library was now 100,000 / 96 = 
1,050. 

An additional repetition of the depicted scheme with a division of now about 3,000 clones 
gave a once again enriched partial library with a mixture relation of 3,000 / 96 = 3 1 . 

As from this last partial library 96 clones were subdivided on a MTP, three wells resulted 
with activity. As these activities now resulted from an individual clone respectively, the 
activity of RNase Tl wildtype could be directly allocated to this clone. 



Execution example 2 



Wildtype RNase Tl cleaves RNA in a highly specific way after guanosine residues. The aim 
of this execution example is to obtain RNase Tl variants which can cleave RNA at adenosine 
residues. Therefore an RNase Ta library was produced and screened for corresponding 
variants. 



1. Design of the library 



The region of the guanosine binding loop 1 which needed to be mutagenized comprises the 
amino acids 41 to 57 of RNase Tl wildtype (SEQJD No. 3). The loop 1 -DNA-sequence is 
mutated by a corresponding synthesized mutagenesis-oligodesoxynucleotide Loopl_32 in a 
way that 3 to 4 of the 17 amino acids respectively are randomly replaced by others. Therefore 
the following sequence is synthesized: 
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5' -GTAGGATCCAATTCTTACCCACAC aay tax aax aax tax gay ggz ttz gaz ttx 
tcz gty agx tcz ccx tax tax GAATGGCCTATCCTCTCGAGCGG-3 ' 

in which „n« (A, G, C or T -„any") and „b» (G, C or T - not A) from SEQJD No. 9 are 
precisely defined as follows: 



a = 86%A6%C 4%G 

c = 86%C6%A 4%G * « C - 88 % C 6%G 6%T 

9=79%G8%A 8%C y = 9'= 82%G 11%C 7%T 

t = 79%T8%A 8%C z=t'=82%Tll%C 7%G 

With A = Adenine, C = Cytosine, G = Guanine, T = Thymine. 

The oligonucleotide Loopl_32 (IBA, Goettingen, Germany) is afterwards directly used as a 
primer (in section 3.1) in a PCR. 

2. Production of the vector for the screening 

The gene of RNase Tl wildtype (SEQJD No. 3) including the signal peptide for a 
periplasmatic expression is cloned into the vector pETBlue-2 (Seq_ID No. 6) as described in 
the execution example 1 (section 1.1. - 1.3.) and the vector pETBlue-RNase Tl -wildtype is 
obtained. 

Afterwards the vector pETBlue-RNase Tl-wildtype is digested with PvuH und Sspl (both 
from MBI Fermentas, Vilnius, Lithuania): 

Reaction: 

4 ug pETBlue-2 
2 pi lOx buffer G (MBI) 
10 U Sspl 
10 U PvuH 
ad 20 pi H 2 0 dest. 

The restriction digest reaction is incubated for 2 h at 37°C. Afterwards the enzymes are 
inactivated for 20 min at 80°C. The products are separated on a 0.8% agarose gel and the 
product band at 2498 bp is cut out from the gel. The DNA is consecutively re-isolated via the 
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QIAquick gel-extraction-kit (Qiagen, Hilden, Germany). 200 fmol of the isolated fragment 
are recircularized in a ligation: 

Reaction: 200 fmol fragment 

2 pi 1 Ox Ligase-buffer (MBI) 

2 nl 50 % PEG (MBI) 

1 pi T4-DNA-Ligase 

ad 20 pi H 2 0 dest. 

The reactions are incubated for 8 h at 16 °C and the enzyme is subesequently inactivated by a 
10 minute incubation at 65°C. 1 pi of this reaction was directly used for the transformation of 
commercially available competent ElectroTen-cells (Stratagene, La Jolla, USA) with 
electroporation. The electroporated cells were plated on agar plates with ampicillin and 
cultivated over night at 37°C. Starting from a resulting single colony the ready plasmid was 
re-isolated with the plasmid-purification kit QIAprep Minipreparation-kit (Qiagen, Hilden, 
Germany) following the manufacturers instructions. The thereby obtained plasmid is named 
pETMini_RNaseTl_wildtype. 

3. Cloning of the library RNaseTI-Lonp l 

With the both primers Loopl_32 (SEQID No. 9) and A2Hi_PstI (SEQJD No. 2) (both from 
IBA Goettingen, Germany) a part of the RNase Tl Wildtyp (SEQJD No. 3) is amplified 
from the original vector pA2Tl (SEQ ID No. 5) through a PCR under the following 
conditions: 

3.1 PCR: 



PCR-reaction: 10 pi 

2 pi 

lOOpmol 
100 pmol 

1 Ml 

2U 

ad 100 pi 

Temperature profile of the PCR: 

1. 

2. 



lOx Taq-buffer (MBI Fermentas, Vilnius, Lithuania) 
dNTPs (each 10 mmol/liter) 



primer Loop 1 32 
primer A2Hi_PstI 
original vector (20 ng) 
Taq-polymerase (MBI) 
H 2 0 dest. 

2 min / 94 °C 

45 sec / 94 °C (denaturation) 
45 sec / 57 °C (annealing) 



(refer to section 1) 
(SEQJD No. 2) 
(SEQJD No. 5) 



30 x 
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3. 30 sec / 72 °C (elongation) 
2 min / 72 °C 

The resulting PCR-products were purified with the QIAquick PCR-purification-kit (Qiagen, 
Hilden, Germany) following the manufacturers instructions. 

3.2 Restriction dig est: 

To clone the library into the expression vector pETMini_RNaseTl_wildtype the PCR product 
and the vector are incubated using the restriction endonucleases BamHI and PstI (both from 
MBI Fermentas, Vilnius, Lithuania) as follows: 

Restriction digest reactions: 



PCR-Product: 




Vector: 




2ug 


PCR-product 


4ug 


pETMiniRNaseT 1 wildtype 


2 pl 


lOx buffer G + (MBI) 


2 pi 


lOx buffer G + (MBI) 


10U 


BamHI 


10U 


BamHI 


10 U 


PstI 


10U 


PstI 


ad 20 pi 


H 2 Q dest. 


ad 20 pi 


H 2 Q dest. 



The restriction digest reactions are incubated for 2 h at 37 °C. To the „vector-reaction" 
subsequently for the dephosphorylation 1 U SAP (MBI Fermentas, Vilnius, Lithuania) is 
added and incubated for additional 30 min at 37 °C. Afterwards the enzymes get inactivated 
for 20 min at 80 °C. The products are separated on a 0.8% agarose gel and for the vector 
reaction the product band at 2608 bp and for the PCR-reaction the product band at 259 bp is 
cut out from the gel. The DNA is consecutively re-isolated from the gel pieces via the 
QIAquick gel-extraction-kit (Qiagen, Hilden, Germany). 

3.3 Ligation, transform ation into F. coli and p l asmid-re-isolation 

The vector DNA and the PCR product are connected with T4-DN A-Li gase as follows: 

Ligase-reaction: 200 frnol Vector-DNA 

600 frnol PCR-product 

3 pi 1 Ox Ligase-buffer (MBI) 

1 pl T4-DNA-Ligase 

ad 30 pl H 2 Q dest. 
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The reactions are incubated for 8 h at 16°C and subsequently the enzyme was inactivated by a 
10 minute incubation at 65°C. The enzymes are removed from the solution by shaking out 
with phenol/chloroform twice and the obtained aqueous solution is precipitated by adding the 
2.5-fold volume of ethanol and incubation for 1 h at -20°C. The reaction subsequently is 
centrifuged for 15 minutes with 15,000 rpm at 4°C and the pellet is washed with 70% ethanol. 
After an additional 15 minute centrifugation at 13,000 rpm at 4°C the ethanol is taken off and 
the DNA-pellet is dried. Afterwards the DNA is resolved in 3 ul H 2 0 dest. and directly used 
for the transformation of commercially available competent ElectroTen-cells (Stratagene, La 
Jolla, USA) via electroporation. From the electroporated cells 10 ul are plated on agar plates 
with ampicillin and incubated at 37°C. The rest of the electroporated cells is directly diluted 
into 100 ml liquid medium (LB-medium: 10 g Tryptone, 5 g yeast extract (both from Becton 
Dickinson, Heidelberg, Germany), 10 g NaCl (Sigma, Deisenhofen, Germany)) containing 
ampicillin and also incubated over night. The colonies on the agar plate are counted and from 
the value the total size of the whole library is determined. Starting from 5 ml of the liquid 
culture, in which the clone mixture has grown, the ready plasmid library is isolated with the 
plasmid purification kit QIAprep Mini-preparation-kit (Qiagen, Hilden, Germany) following 
the manufacturer's instructions. As the result one obtains a library of up to 10 7 different 
RNaseTl Loopl-variants: pETMini_RNaseTl_Ll . 

3.4 Production of the expression strain: 

For the expression of the RNase Tl-test library an E. coli strain is needed, in which the 
RNase I is knocked out. Corresponding strains like for example AT9 (ma l9 X gdhA2 relAl 
spoTl metBl) are available via the E. coli Genetic Stock Center New Haven, USA. The 
expression vector pETBlue-2 used in the example additionally needs the T7-RNA-polymerase 
for the expression, which is not present in E. coli. With the commercially available JUDE3- 
Lysogenisation-kit (Novagen, Madison, USA) the T7-RNA-polymerase coding gene is 
introduced into the strain AT9. Through this an E. coli-strain is obtained, which is 
characterized by the absence of RNase I and the presence of the T7-RNA-polymerase (DE3). 
Electrocompetent cells were prepared from this strain with standard molecular biology 
methods and stored at -80°C. 
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3.5 Transformation of the expression strain with the library: 

Into the strain produced as precedent described 1 ng of the library pETMiniRNaseT 1_L 1 
was transformed via electroporation and the resulting cells were taken up into 200 ml liquid 
medium (LB-medium: 10 g Tryptone, 5 g yeast extract (all from Becton Dickinson, 
Heidelberg, Germany), 10 g NaCl (from Sigma, Deisenhofen, Germany)) containing 
ampicillin after 1 hour incubation at 37°C. 

10 ml of the thus obtained preparatory culture are immediately divided onto a 96 well 
microtiterplate (MTP) (100 ul per well) and incubated at 30°C and 800 rpm overnight. 

Thereby about 1 50,000 clones are obtained on the MTP. 

3.6 Growth of the main culture and expression of RNase Tl 

A 96-well deep well plate (DWP) is filled with 1.5 ml liquid medium with ampicillin per well 
respectively. The medium is inoculated with 50 ul from the preparatory culture respectively 
and the DWP is cultured at 37°C and 800 rpm. When an optical density OD 600 of the cultures 
of ODeoo = 1.0 is reached the cultures are induced with 1 mmol/liter IPTG. Afterwards the 
plate is incubated for additional 4 h at 37°C and 800 rpm. 

3.7 Preparation of protein samp les 

By the signal peptide ompA the expressed RNase Tl -molecules are directed into the 
periplasmatic space of the expression bacterium. Through an osmotic shock the protein can be 
prepared very easily. The purification procedure comprises the following steps: 

o Collection of the cells by centrifugation at 4000 rpm, 4°C for 5 min, 
o Decantation of the medium supernatant, 

° Resuspension of the bacterial pellet in 25 ul buffer A (50 mmol/liter Tris/HCl, pH 7.5, 

10 mmol/liter EDTA, 15 % Saccharose w/v) respectively, 
o Incubation on ice for 30 min, 

o Addition of 125 ul buffer B (50 mmol/liter Tris/HCl, pH 7.5, 10 mmol/liter EDTA) 
respectively, 

o Centrifugation at 4000 rpm, 4 °C, for 20 min, 

o Removal of the supernatant and transfer into a MTP (Periplasm), 

° Storage of the bacterial pellet. 
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3.8 Production of the substrate for RNase T1 

As a substrate (Sub_A) a double stranded DNA-molecule with a central single stranded area 
was used, which now contains an adenosine-RNA-Building block as point of attack for the 
enzyme. The ends of this substrate are labeled with differing dyes for the red (Cy5 at the 5*- 
end) and the green (RhG at the 3'-end) spectral range. In order to avoid a bleaching of the 
labeled substrate the corresponding solutions and incubation reactions are protected from 
light. The buffers and reactions were produced with DEPC-treated water. The substrate is 
composed of the following three oligonucleotides (IB A Goettingen, Germany): 

l.Sub_A: 

5'-Cy5-CCATACCAGCCAGCCACAArACAAGCCACCGAAGCACAGATA-RhG-3' 

(SEQJD No. 11) 

2. TISubJLi: 

5 '-GTGGCTGGCTGGTATGGA-3 ' (SEQJD No. 7) 

3. Tl_Sub_Re: 

5 '-TATCTGTGCTTCGGTGGC-3 ' (SEQJD No. 8) 

By the consecutively described hybridisation the three components are annealed to a double 
stranded substrate: 

Hybridisation reaction: Hybridisation program: 

1 000 pmol Sub_A 1 . i o sec 94°C; 

1200 pmol Tl_Sub_Li 2 . Cooling to 25 °C with 0,1 °C/sec 

1200 pmol Tl_Sub_Re 3. 4 °C 

20 ul MES (1 mol/liter, pH 6.0) 
ad 1000ulDEPC-H 2 O 

3.9 Incubation of the pr otein samp le s with the substrate 

In a MTP 10 ul of the double stranded substrate are provided per well respectively. Thereto 
10 ul of the protein samples isolated from the periplasm are added respectively, the MTP is 
sealed air-proof and incubated for 24 h at 37°C in the dark. Afterwards 5 ul of the reactions 
are transferred into a MTP with glass bottom respectively and mixed with 250 ul buffer C 
respectively (100 mmol/liter MES, pH 6.0, 100 mmol/liter NaCl, 2 mmol/liter EDTA). 
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3.10 Activity determination 

In order to determine the enzyme activity the plate with the glass bottom, into which the 
incubation reactions were transferred as described in 1.10, was measured on the fluorescence 
correlation spectroscope ConfoCor 2 (Evotec Biosystems, Hamburg, Germany and Carl Zeiss 
Microscopy, Jena, Germany). The evaluation of the date was conducted using the ConfoCor 
2-software (version 2.5). 

For the measurements an Argon-laser (1 = 488 nm) is used for the excitation of RhG in 
combination with a helium/neon-laser (1 = 633 nm) for Cy5. The FCS measurement volume in 
the cavities was adjusted 200 urn above the glass surface. The measurements were conducted 
for 20 sec per well. 

By a cross correlation analysis of the obtained data one can conclude on an eventual cleavage 
of the substrate. A cleavage of the substrate by RNase Tl leads to a decoupling of both 
fluorescent dyes and therefore to a loss of the cross correlation signal. Uncut substrate 
molecules in contrast carry both dyes and deliver a strong signal. 

Fig. 2 shows the thus obtained measurement data for a RNaseTl_Loopl -library produced 
according to the execution example 2 consisting of 150,000 clones on one plate. The RNase 
Tl-activity was detected as described above via cross correlation analysis. For a better 
overview a reciprocal view was chosen, i.e. that high peaks mean a low signal and low peaks 
a high signal. Fig. 2 shows 1 clear peak, which is caused by a loss of the cross correlation 
signal. This peak indicates that in the experiment an RNase Tl-activity, which now is able to 
cut a substrate after A, was present in one of the 96 wells. 

4. Re-isol ation of the partial library 

In the plate obtained according to execution example 2 (section 1. - 3.10.) a plasmid 
preparation is conducted with the stored bacterial pellet from the protein preparation of the 
well, in which the activity determination (3.10.) has shown an RNase Tl-activity after 
adenosine, using the QIAprep Mini-preparation-kit (Qiagen, Hilden, Germany). 

Through the original division of 150,000 clones on the plate a number of 150,000 / 96 = 1563 
different clones per well resulted. 
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5.1 Further sep a rations - 1 _ step 

Through a transformation of different aliquots of the thus obtained partial library analogous to 
the execution example 1 (section 1.6) the amount of plasmid DNA was determined, which is 
necessary, to now obtain 5,000 transformed clones via electroporation. 

Afterwards the determined amount of the partial library was transformed into the expression 
strain and the same process as for the original library is conducted. 

As in the original well 1563 different clones were present und about 5000 clones were divided 
up, it should be possible to find the adenosine-cleaving activity showing clone about 3 times. 

Fig. 3 shows the obtained data fort his partial library. One well was detected with a very high 
activity and three additional were detected with an activity which can be clearly distinct from 
the background, so that the clone was present 4 times in the plate. The well with the highest 
activity value was chosen for the additional singling step. In this well no more than 5000 / 96 
= 52 different clones were present. The plasmids in turn were re-isolated from the bacterial 
pellet in this well. 

5.2 Add itional separations - 2. step 

An additional repetition of the depicted scheme with a division of now about 500 clones lead 
to an additional enriched partial library of in average 250 / 96 = 5.2 clones per well. The 
activity producing clone could be re-found on this plate 10 times (Fig. 4). From one of the 
activity showing wells again the plasmids were isolated from the bacterial pellet. 

5.3 Addit ional sep ar ations - 3. step 

An aliquot of the plasmid mixture was electroporated into the expression strain and the 
transformants were plated on an agar plate and the plate was incubated at 37°C overnight. 
From the grown single colonies 20 were selected and therewith 100 ul of preparatory culture 
were directly put forth on a MTP like in 3.5. After conducting the steps 3.6 - 3.10 the detected 
activity could be allocated to a single clone and the genotype of the adenosine-cleaving 
RNaseTl -variant could be identified. 
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List of abbreviations: 

In the description of the invention the following abbreviations are used: 

B. subtilis Bacillus subtilis 

C. lucknowese Chrysosporium lucknowese 

Cy5 Fluorescence dye Cy5™ (Amersham Biosciences UK Limited, Little 

Chalfont, Buckinghamshire, GB) 

DEPC Diethyl pyrocarbonate 

DW P Deep well plate 

E - coli Eschericha coli 

EDTA Ethylene diamine tetra acetic acid 

h hour 

WTG Isopropyl-|3-D-thiogalacto-pyranoside 

LB Luria Broth 

MES Morpholinoethane sulfonic acid 

m * n minutes 

MTP microtiter plate 

0° optical density 

OD 6oo optical density at 600 nm 

om P A °uter membrane protein A from E. coli 

P plasmid 

PCR polymerase-chain-reaction 

p T7 T7-promotor 

rA Riboadenylic acid residue 
Riboguanylic acid residue 

T P m rounds per minute 

RhG Rhodamine Green (Fluorescence dye) 

SAP Alkaline phosphatase from shrimp 

S. cerevisiae Saccharomyces cerevisiae (yeast) 

Tns T ris-(hydroxymethyl)-aminomethane 

T4 coming from bacteriophage T4 

U Unit (for enzyme activity) 

w/v weight per volume 



Patent claims 



1- Method for the identification of biomolecules in variant libraries of 
biomolecules comprising the steps: 

a) Production of a variant library, consisting of a number of variants (B 0 ) 
of gene sequences coding for the biomolecule, and 

b) Division of the variant library into a number of compartments (W 0 ), 
which is at least by a factor often smaller than the number of variants 
in the variant library (B 0 ), 

Whereas each compartment contains a partial library which contains 
Ko=B{/W 0 variants, 

c) Production of biomolecules in the compartments and testing of the 
biomolecules obtained in the single compartments for a specified 
property (phenotype), whereas from the observed phenotype no direct 
conclusions on the genotype can be made, 

d) Selection of at least one compartment, which contains biomolecules 
fulfilling the wanted properties, 

e) Division of the partial library contained in the selected compartment 
into further compartments and 

f) n-fold repetition of the steps c) to e) until in every compartment 
maximally only one variant (K„<=1) of the gene sequence coding for 
the biomolecule is contained. 



The method of claim 1, wherein the wanted property is a biocatalytic activity. 

The method of claim 1 or 2, wherein in step c) also an amplification of the 
partial library takes place in the compartments up to an number of individuals 
V 0 (x) at the point in time x per compartment, whereas the number of 
individuals V 0 (x) divided, by the number of clones per compartment Ko gives 
the amplification factor F 0 (x) per clone. 



The method of one of the claims 1 to 3, wherein in step e) the division is 
carried out under dilution of the partial library by means of factor F 0 (x), so that 
in a given volume every clone contained in the compartment is statistically 
present up to a number Xo < W,, this volume is divided up in a number of new 
compartments W,, whereas the new number of clones per compartment 
amounts to K, = Xo * Ko / W, . 

The method of one of the claims 1 to 4, wherein the variant library contains 10 3 
to 10 15 variants of the gene sequence of the biomolecule. 

The method of one of the claims 1 to 5, wherein in step b) the variant library is 
divided up in 10 1 to 10 4 compartments. 

The method of one of the claims 1 to 6, wherein in step b) the variant library is 
transferred into an organism before division. 

The method of claim 7, wherein in step c) the culture of the organism after 
division is amplified to a number of organisms of 10 8 to 10 9 per compartment. 

The method of claim 7 or 8, wherein the organisms also conduct the production 
of the biomolecules. 



The method of one of the claims 7 to 9, wherein the partial libraries in the 
compartments are re-isolated from the organisms und the production of the 
biomolecules is conducted by cell-free systems. 

The method of one of the claims 1 to 6, wherein the amplification of the partial 
libraries and the production of the biomolecules is conducted by cell-free 
systems. 



method of one of the claims 1 to 1 1, wherein the variant library consists of 
\-plasmids, which contain the gene sequence coding for the biomolecule. 



The method of one of the claims 1 to 1 1, wherein the variant library consists of 
linear nucleic acid molecules, which contain the gene sequence coding for the 
biomolecule. 

The method of one of the claims 1 to 13, wherein the biomolecules are 
enzymes or ribozymes or other biomolecules, which exhibit a biocatalytic 
activity. 

The method of one of the claims 1 to 14, wherein the test for a biocatalytic 
activity is conducted with physical detection methods, like preferentially the 
UV/VIS-spectroscopy, the fluorescence spectroscopy or the fluorescence- 
correlation-spectroscopy. 



IAP15 Rec'd PCT/PTO 2 0 APR 2006 

SEQUENCE LISTING 

<110> c-LEcta GmbH 

libraries th ° d ^ SeleCtion of ^molecules from biomolecule variant 

<130> 401P03DPCT 

<150> DE10350474.5 
<151> 2003-10-23 

<160> 11 

<170> Patentln version 3.3 

<210> 1 

<211> 28 

<212> DNA 

<213> artificial 

<400> 1 

caattctgca gttgcgttca cgtcgttg 
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<210> 2 

<211> 28 

<212> DNA 

<213> artificial 

<400> 2 

taaggctcat gaaaaacaca gctatcgc 



<210> 3 

<211> 378 

<212> DNA 

<213> Escherichia coli 
<220> 

<221> ompA-signal peptide 

<222> (1) . . (63) 

<223> 

<220> 

<221> RNase Tl wildtype 

<222> (64) . . (378) 
<223> 

<400> 3 



28 



atgaaaaaca cagctatcgc gattgcagtg gcactggctg gtttcgctac cgtagcgcag 60 
gccgcatgcg actacacttg cggttctaac tgctactctt cttcagacgt ttctactgct 120 
caggcggccg gatataaact tcacgaagac ggtgaaactg ttggatccaa ttcttaccca 180 
cacaagtaca acaactacga aggttttgat ttctctgtga gctctcccta ctacgaatgg 
cctatcctct cgagcggtga tgtttactct ggtgggtccc cgggtgctga ccgtgtcgtc 
ttcaacgaaa acaaccaact agctggtgtt atcactcaca ctggtgcttc tggtaacaac 
ttcgttgaat gtacataa 



240 
300 
360 
378 



1 



<210> 4 
<211> 378 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> ompA- signal peptide 

<222> (1) . . (63) 

<223> 

<220> 

<221> RNaseTl -His 92 Ala 
<222> (64) . . (378) 
<223> 
<400> 4 

atgaaaaaca cagctatcgc gattgcagtg gcactggctg gtttcgctac cgtagcgcag 60 
gccgcatgcg actacacttg cggttctaac tgctactctt cttcagacgt ttctactgct 12 0 
caggcggccg gatataaact tcacgaagac ggtgaaactg ttggatccaa ttcttaccca 180 
cacaagtaca acaactacga aggttttgat ttctctgtga gctctcccta ctacgaatgg 24 0 
cctatcctct cgagcggtga tgtttactct ggtgggtccc cgggtgctga ccgtgtcgtc 300 
ttcaacgaaa acaaccaact agctggtgtt atcactgcca ctggtgcttc tggtaacaac 360 
ttcgttgaat gtacataa 378 



<210> 


5 


<211> 


7336 


<212> 


DNA 


<213> 


Plasmid pA2Tl 


<220> 




<221> 


lac promotor 


<222> 


(1) . . (371) <223> 


<220> 




<221> 


ompA- signal peptide 


<222> 


(393) . . (455) 


<223> 




<220> 




<221> 


RNaseTl - wi Idtype 


<222> 


(456) . . (770) 


<223> 




<220> 




<221> 


lad-Gene 


<222> 


(1664) . . (2887) 


<223> 




<220> 




<221> 


OR I 


<222> 


(4924) . . (5115) 


<223> 




<220> 




<221> 


Beta- lactamase (Amp) 


<222> 


(7165 . . (6302) ) 


<223> 




<400> 
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taggcgtatc acgaggccct ttggataacc agaagcaata aaaaatcaaa tcggatttca 60 
ctatataatc tcactttatc taagatgaat ccgatggaag catcctgttt tctctcaatt 120 
tttttatcta aaacccagcg ttcgatgctt ctttgagcga acgatcaaaa ataagtgcct 180 
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tcccatcaaa aaaatattct caacataaaa aactttgtgt aatacttgta acgctacatg 240 

gagattaact caatctagct agagaggctt tacactttat gcttccggct cgtataatgt 300 

gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacggat 3 60 

tcactggaac tctagataac gaggcgcaaa aaatgaaaaa cacagctatc gcgattgcag 420 

tggcactggc tggtttcgct accgtagcgc aggccgcatg cgactacact tgtggttcca 480 

actgctactc ttcttcagac gtttctactg ctcaagcggc cggatataaa cttcacgaag 540 

acggtgaaac tgttggatcc aattcttacc cacacaaata caacaactac gaaggttttg 600 

atttctctgt gagctctccc tactacgaat ggcctatcct ctcgagcggt gatgtttact 660 

ctggtgggtc cccgggtgct gaccgtgtcg tcttcaacga aaacaaccaa ctagctggtg 720 

ttatcactca cactggtgct tctggtaaca acttcgttga atgtacataa gcttggatcg 780 

atccgggctg agcaacgacg tgaacgcaat gcgttccgac gttcaggctg ctaaagatga 840 

cgcagctcgt gctaaccagc gtctggacaa catggctact aaataccgca agfcaatagta 900 

cctgtgaagt gaaaaatggc gcacattgtg cgacattttt tttgtctgcc gtttaccgct 960 

actgcgtcac gcgtaacata ttcccttgct ctggttcacc attctgcgct gactctactg 1020 

aaggcgcatt gctggctgcg ggagttgctc cactgctcac cgaaaccgga taccctgccc 1080 

gacgatacaa cgctttatcg actaacttct gatctacagc cttattgtct ttaaattgcg 1140 

taaagcctgc tggcagtgtg tatggcattg tctgaacgtt ctgctgttct cctgccgata 1200 

gtggtcgatg tacttcaaca taacgcatcc cgttaggctc cacggaatat ttcaccggtt 1260 

cgttgatcac tttcaccggc gttcccgtcc gcacgctgga gaacaaggct ttaatatccg 1320 

gtgcattcat gcgaatacac cctgaactga cgcgcaaacc gacgctgtcc ggcgcactgg 13 80 

taccatgaat gaggtattcg ccattaccat gcgcgaggcg cagtgcgtaa cgtcctagcg 1440 

ggttatttgg tccggcagga acgactggcg gtaatttaat gccacgctcc agcgaacgct 1500 

gacgaatgcc tgccgtaggc gtccaggttg ggttagggat tttctgccca acacgcgttt 1560 

ccatcaccgg cgtttccagc ccctgcaatc caatacctat tggataaacc tgcacaatat 1620 

tttctcccgg cggataataa taaaggcgca gctctgcaag gttgacacca tcgaatggcg 1680 

caaaaccttt cgcggtatgg catgatagcg cccggaagag agtcaattca gggtggtgaa 174 0 

tgtgaaacca gtaacgttat acgatgtcgc agagtatgcc ggtgtctctt atcagaccgt 1800 

ttcccgcgtg gtgaaccagg ccagccacgt ttctgcgaaa acgcgggaaa aagtggaagc 1860 

ggcgatggcg gagctgaatt acattcccaa ccgcgtggca caacaactgg cgggcaaaca 1920 

gtcgttgctg attggcgttg ccacctccag tctggccctg cacgcgccgt cgcaaattgt 1980 
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cgcggcgatt aaatctcgcg ccgatcaact gggtgccagc gtggtggtgt cgatggtaga 2040 
acgaagcggc gtcgaagcct gtaaagcggc ggtgcacaat cttctcgcgc aacgcgtcag 2100 
tgggctgatc attaactatc cgctggatga ccaggatgcc attgctgtgg aagctgcctg 2160 
cactaatgtt ccggcgttat ttcttgatgt ctctgaccag acacccatca acagtattat 2220 
tttctcccat gaagacggta cgcgactggg cgtggagcat ctggtcgcat tgggtcacca 2280 
gcaaatcgcg ctgttagcgg gcccattaag ttctgtctcg gcgcgtctgc gtctggctgg 2340 
ctggcataaa tatctcactc gcaatcaaat tcagccgata gcggaacggg aaggcgactg 24 00 

gagtgccatg tccggttttc aacaaaccat gcaaatgctg aatgagggca tcgttcccac 2460 

tgcgatgctg gttgccaacg atcagatggc gctgggcgca atgcgcgcca ttaccgagtc 2520 

cgggctgcgc gttggtgcgg atatctcggt agtgggatac gacgataccg aagacagctc 2580 

atgttatatc ccgccgtcaa ccaccatcaa acaggatttt cgcctgctgg ggcaaaccag 2640 

cgtggaccgc ttgctgcaac tctctcaggg ccaggcggtg aagggoaatc agctgttgcc 2700 

cgtctcactg gtgaaaagaa aaaccaccct ggcgcccaat acgcaaaccg cctctccccg 2760 

cgcgttggcc gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca 2820 

gtgagcgcaa cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact 2880 

ttatgctaac gataatcccc tgacgcggtg catcaggtaa taacagttgt gaaggaatag 2940 

ttatcgtcgt accaggtttt ggcaccgggg cgatagtgtt attggcttca aggatcaaca 3000 

ttgccgcagt atcaaaacgt cgggcaatag cctgaaggtt tttatcccct tcttgcaccg 3060 

tatacgtttg attttgccca accagtcggc ttccggttgg tggtagcgga taatcaaccg 3120 

cccaggcagc ctggatggcg ctaaaagcgc cgataagcgt gagtgtaagc aaagacgcgc 3180 

gtttcattgt aaacctcctg tatttgccgg agactcacgc tgaaacgtcg gatggcgctt 3240 

atgttcacct gaaaccaaaa cactcctgtg caggtcagtg taaacattga ccatccggca 3300 

atgtgagcca accggatgaa agctgtcctt ttagtttagc taagtgcagc ggctttggcg 3360 

cgaattgcgc gaatcatcgc ttccagacct tgtgaacgag atggggtgag atgttgggtg 3420 

agcgccattt tttcaaacca cggacgcaca tcgaaattga caatatcctg cggcgtcatc 3480 

tgatcgtaga gaataaagac gaccgcaata agccctttca caatcgccgc atcgctgtcg 3540 

ccctgtaatt caataattcc ctgggcattc tggcgcatga caatccacac ctgactctga 3600 

cagccctgaa tgctattttg tggacttctg tcttcgtcgc gtaattctgg cagacgctgg 3660 

gggaccgatg cccttgagag ccttcaaccc agtcagctcc ttccggtggg cgcggggcat 3720 

gactatcgtc gccgcactta tgactgtctt ctttatcatg caactcgtag gacaggtgcc 3780 

ggcagcgctc tgggtcattt tcggcgagga ccgctttcgc tggagcgcga cgatgatcgg 3840 
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cctgtcgctt gcggtattcg gaatcttgca cgccctcgct caagccttcg tcactggtcc 3 900 
cgccaccaaa cgtttcggcg agaagcaggc cattatcgcc ggcatggcgg ccgacgcgct 3 960 

gggctacgtc ttgctggcgt tcgcgacgcg aggctggatg gccttcccca ttatgattct 4 020 

tctcgcttcc ggcggcatcg ggatgcccgc gttgcaggcc atgctgtcca ggcaggtaga 4080 

tgacgaccat cagggacagc ttcaaggatc gctcgcggct cttaccagcc taacttcgat 414 0 

cactggaccg ctgatcgtca cggcgattta tgccgcctcg gcgagcacat ggaacgggtt 4200 

ggcatggatt gtaggcgccg ccctatacct tgtctgcctc cccgcgttgc gtcgcggtgc 4260 

atggagccgg gccacctcga cctgaatgga agccggcggc acctcgctaa cggattcacc 4320 

actccaagaa ttggagccaa tcaattcttg cggagaactg tgaatgcgca aaccaaccct 4 380 

tggcagaaca tatccatcgc gtccgccatc tccagcagcc gcacgcggcg catctcgggc 4440 

agcgttgggt cctggccacg ggtgcgcatg atcgtgctcc tgtcgttgag gacccggcta 4500 

ggctggcggg gttgccttac tggttagcag aatgaatcac cgatacgcga gcgaacgtga 4560 

agcgactgct gctgcaaaac gtctgcgacc tgagcaacaa catgaatggt cttcggtttc 4620 

cgtgtttcgt aaagtctgga aacgcggaag tcagcgccct gcaccattat gttccggatc 4680 

tgcatcgcag gatgctgctg gctaccctgt ggaacaccta catctgtatt aacgaagcgc 4 74 0 

tggcattgac cctgagtgat ttttctctgg tcccgccgca tccataccgc cagttgttta 4 800 

ccctcacaac gttccagtaa ccgggcatgt tcatcatcag taacccgtat cgtgagcatc 4860 

ctctctcgtt tcatcggtat cattaccccc atgaacagaa atccccctta cacggaggca 4920 

tcagtgacca aacaggaaaa aaccgccctt aacatggccc gctttatcag aagccagaca 4 980 

ttaacgcttc tggagaaact caacgagctg gacgcggatg aacaggcaga catctgtgaa 5040 

tcgcttcacg accacgctga tgagctttac cgcagctgcc tcgcgcgttt cggtgatgac 5100 

ggtgaaaacc tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat 5160 

gccgggagca gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggcgca 5220 

gccatgaccc agtcacgtag cgatagcgga gtgtatactg gcttaactat gcggcatcag 5280 

agcagattgt actgagagtg caccatatgc ggtgtgaaat accgcacaga tgcgtaagga 5340 

gaaaataccg catcaggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 5400 

ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 5460 

caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 5520 

aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa 5580 

atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 564 0 
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cccctggaag ctccctcgtg cgctctcctg ttccgaccct 
ccgcctttct cccttcggga agcgtggcgc tttctcatag 
gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca 
accgctgcgc cttatccggt aactatcgtc ttgagtccaa 
cgccactggc agcagccact ggtaacagga ttagcagagc 
cagagttctt gaagtggtgg cctaactacg gctacactag 
gcgctctgct gaagccagtt accttcggaa aaagagttgg 
aaaccaccgc tggtagcggt ggtttttttg tttgcaagca 
aaggatctca agaagatcct ttgatctttt ctacggggtc 
actcacgtta agggattttg gtcatgagat tatcaaaaag 
taaattaaaa atgaagtttt aaatcaatct aaagtatata 
gttaccaatg cttaatcagt gaggcaccta tctcagcgat 
tagttgcctg actccccgtc gtgtagataa ctacgatacg 
ccagtgctgc aatgataccg cgagacccac gctcaccggc 
accagccagc cggaagggcc gagcgcagaa gtggtcctgc 
agtctattaa ttgttgccgg gaagctagag taagtagttc 
acgttgttgc cattgctgca ggcatcgtgg tgtcacgctc 
tcagctccgg ttcccaacga tcaaggcgag ttacatgatc 
cggttagctc cttcggtcct ccgatcgttg tcagaagtaa 
tcatggttat ggcagcactg cataattctc ttactgtcat 
ctgtgactgg tgagtactca accaagtcat tctgagaata 
gctcttgccc ggcgtcaaca cgggataata ccgcgccaca 
tcatcattgg aaaacgttct tcggggcgaa aactctcaag 
ccagttcgat gtaacccact cgtgcaccca actgatcttc 
gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc 
cacggaaatg ttgaatactc atactcttcc tttttcaata 
gttattgtct catgagcgga tacatatttg aatgtattta 
ttccgcgcac atttccccga aaagtgccac ctgacgtcta 
cattaaccta taaaaa 



gccgcttacc 
ctcacgctgt 
cgaacccccc 
cccggtaaga 
gaggtatgta 
aaggacagta 
tagctcttga 
gcagattacg 
tgacgctcag 
gatcttcacc 
tgagtaaact 
ctgtctattt 
ggagggctta 
tccagattta 
aactttatcc 
gccagttaat 
gtcgtttggt 
ccccatgttg 
gttggccgca 
gccatccgta 
gtgtatgcgg 
tagcagaact 
gatcttaccg 
agcatctttt 
aaaaaaggga 
ttattgaagc 
gaaaaataaa 
agaaaccatt 



ggatacctgt 
aggtatctca 
gttcagcccg 
cacgacttat 
ggcggtgcta 
tttggtatct 
tccggcaaac 
cgcagaaaaa 
tggaacgaaa 
tagatccttt 
tggtctgaca 
cgttcatcca 
ccatctggcc 
tcagcaataa 
gcctccatcc 
agtttgcgca 
atggcttcat 
tgcaaaaaag 
gtgttatcac 
agatgctttt 
cgaccgagtt 
ttaaaagtgc 
ctgttgagat 
actttcacca 
ataagggcga 
atttatcagg 
caaatagggg 
attatcatga 



5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7336 
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<210> 6 
<211> 3653 
<212> DNA 

<213> Plasmid pETBlue-2 
<220> 

<2 2 1 > T7 -promotor 
<222> (1) . . (17) 
<223> 
<220> 

<221> lac operator 
<222> (22) . . (42) 
<223> 
<220> 

<221> fl ORI 

<222> (1096) . . (1551) 

<223> 

<220> 

<221> Beta-lactamase (Amp) 
<222> (2556) . . (1669) 
<223> 
<220> 

<221> pUC ORI 

<222> (3206) .. (3250) 

<223> 

<220> 

<221> lac operator 
<222> (3606) . . (3625) 
<223> 
<400> 6 

taatacgact cactataggg gaattgtgag cggataacaa ttcccctcta gacttacaat 60 
ttccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtacg ggcctcttcg 120 
ctattacgcc agcttgcgaa cggtgggtgc gctgcaaggc gattaagttg ggtaacgcca 180 
ggattctccc agtcacgacg ttgtaaaacg acggccagcg agagatcttg attggctagc 240 
agaataattt tgtttaactt taagaaggag atataccatg gcgatatccc gggagctcgt 300 
ggatccgaat tctgtacagg cgcgcctgca ggacgtcgac ggtaccatcg atacgcgttc 360 
gaagcttgcg gccgcacagc tgtatacacg tgcaagccag ccagaactcg ctcctgaaga 420 
cccagaggat ctcgagcacc accaccacca ccactaatgt taattaagtt gggcgttgta 480 
atcatagtca taatcaatac tcctgactgc gttagcaatt taactgtgat aaactaccgc 540 
attaaagcta ttcgatgata agctgtcaaa catgataatt cttgaagacg aaagggccta 600 
ggctgataaa acagaatttg cctggcggca gtagcgcggt ggtcccacct gaccccatgc 660 
cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc catgcgagag 720 
tagggaactg ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgt 780 
tttatctgtt gtttgtcggt gaacgctctc ctgagtagga caaatccgcc gggagcggat 840 
ttgaacgttg cgaagcaacg gcccggaggg tggcgggcag gacgcccgcc ataaactgcc 900 
aggcatcaaa ttaagcagaa ggccatcctg acggatggcc tttttgcgtt tctacaaact 960 
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cttttgttta 
ccccttgggg 
ggattggcga 
cgcgcagcgt 
cttcctttct 
ta 99gttccg 
gttcacgtag 
cgttctttaa 
attcttttga 
tttaacaaaa 
cgatggcatg 
ttttaaatca 
cagtgaggca 
cgtcgtgtag 
accgcgagac 
ggccgagcgc 
ccgggaagct 
tacaggcatc 
acgatcaagg 
tcctccgatc 
actgcataat 
ctcaaccaag 
aatacgggat 
ttcttcgggg 
cactcgtgca 
aaaaacagga 
actcatactc 
tgagcgtcag 
gtaatctgct 
caagagctac 
actgtccttc 



tttttctaaa 
cctctaaacg 
atgggacgcg 
gaccgctaca 
cgccacgttc 
atttagtgct 
tgggccatcg 
tagtggactc 
tttataaggg 
atttaacgcg 
agattatcaa 
atctaaagta 
cctatctcag 
ataactacga 
ccacgctcac 
agaagtggtc 
agagtaagta 
gtggtgtcac 
cgagttacat 
gttgtcagaa 
tctcttactg 
tcattctgag 
aataccgcgc 
cgaaaactct 
cccaactgat 
aggcaaaatg 
ttcctttttc 
accccgtaga 
gcttgcaaac 
caactctttt 
tagtgtagcc 



tacattcaaa 
ggtcttgagg 
ccctgtagcg 
cttgccagcg 
gccggctttc 
ttacggcacc 
ccctgataga 
ttgttccaaa 
attttgccga 
aattttaaca 
aaaggatctt 
tatatgagta 
cgatctgtct 
tacgggaggg 
cggctccaga 
ctgcaacttt 
gttcgccagt 
gctcgtcgtt 
gatcccccat 
gtaagttggc 
tcatgccatc 
aatagtgtat 
cacatagcag 
caaggatctt 
cttcagcatc 
ccgcaaaaaa 
aatcatgacc 
aaagatcaaa 
aaaaaaacca 
tccgaaggta 
gtagttaggc 



tatgtatccg 
ggttttttgc 
gcgcattaag 
ccctagcgcc 
cccgtcaagc 
tcgaccccaa 
cggtttttcg 
ctggaacaac 
tttcggccta 
aaatattaac 
cacctagatc 
aacttggtct 
atttcgttca 
cttaccatct 
tttatcagca 
atccgcctcc 
taatagtttg 
tggtatggct 
gttgtgcaaa 
cgcagtgtta 
cgtaagatgc 
gcggcgaccg 
aactttaaaa 
accgctgttg 
ttttactttc 
gggaataagg 
aaaatccctt 
ggatcttctt 
ccgctaccag 
actggcttca 
caccacttca 



ctgagcaata 
tgaaaggagg 
cgcggcgggt 
cgctcctttc 
tctaaatcgg 
aaaacttgat 
ccctttgacg 
actcaaccct 
ttggttaaaa 
gtttacaatt 
cttttaaatt 
gacagttacc 
tccatagttg 
ggccccagtg 
ataaaccagc 
atccagtcta 
cgcaacgttg 
tcattcagct 
aaagcggtta 
tcactcatgg 
ttttctgtga 
agttgctctt 
gtgctcatca 
agatccagtt 
accagcgttt 
gcgacacgga 
aacgtgagtt 
gagatccttt 
cggtggtttg 
gcagagcgca 
agaactctgt 



actagcataa 
aactatatcc 
gtggtggtta 
gctttcttcc 
gggctccctt 

ta gggtgatg 

ttggagtcca 
atctcggtct 
aatgagctga 
tctggcggca 
aaaaatgaag 
aatgcttaat 
cctgactccc 
ctgcaatgat 
cagccggaag 
ttaattgttg 
ttgccattgc 
ccggttccca 
gctccttcgg 
ttatggcagc 
ctggtgagta 
gcccggcgtc 
ttggaaaacg 
cgatgtaacc 
ctgggtgagc 
aatgttgaat 
ttcgttccac 
ttttctgcgc 
tttgccggat 
gataccaaat 
agcaccgcct 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 
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2880 



acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 

cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 2940 

gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 3000 

cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 3 060 

gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg 3120 

tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 3180 

tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 3240 

gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 3300 

aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 3360 

agcgagtcag tgagcgagga agccggcgat aatggcctgc ttctcgccga aacgtttggt 3420 

ggcgggacca gtgacgaagg cttgagcgag ggcgtgcaag attccgaata ccgcaagcga 3480 

caggccgatc atcgtcgcgc tccagcgaaa gcggtcctcg ccgaaaatga cccagagcgc 3540 

tgccggcacc tgtcctacga gttgcatgat aaagaagaca gtcataagtg cggcgacgac 3600 
cggtgaattg tgagcgctca caattctcgt gacatcataa cgtcccgcga aat 



<210> 7 

<211> 18 

<212> DNA 

<213> artificial 

<400> 7 

gtggctggct ggtatgga 



<210> 8 

<211> ,18 

<212> DNA 

<213> artificial 

<400> 8 

tatctgtgct tcggtggc 



<210> 9 

<211> 98 

<212> DNA 

<213> artificial 

<220> 

<223> Primer Loopl_32 
<220> 



3653 



18 



18 



<400> ^° mpOSitions of n and b *™ ^rther specified in the description 
gtaggatcca attcttaccc acacnnbrmb nnbnnbnnbn nbnnbnnbnn bnnbnnbnnb 60 
nnbnnbnnbn nbnnbgaatg gcctatcctc tcgagcgg 9Q 
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<210> 10 
<211> 

<212> DNA 

<213> artificial 

<220> 

<223> substrate u Sub_G w 
<220> 

<223> the g at position 24 is a ribonucleotide 

<4 00> 10 

ccataccagc cagccacaag caagccaccg aagcacagat a 



<210> 11 
<211> 

<212> DNA 

<213> artificial 

<220> 

<22 3> substrate M Sub A w 
<22 0> 

<223> the a at position 24 is a ribonucleotide 

<4 00> 11 

ccataccagc cagccacaag caaaccaccg aagcacagat a 



Fig.l 



Fig. 2 



Fig. 3 
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Title: Method for the selection of biomolecules from biomolecule variant libraries 

Abstract: The invention relates to a method from the selection of biomolecules from variant 
libraries, in particular of biocatalytically active biomolecules, comprising the steps: a) 
production of a variant library, b) division of the library into a number of compartments, 
which is smaller than the total number of variants in the variant library by a factor of at least' 
10, c) production and testing of the biomolecules in the individual compartments for a 
particular property, for example, a biocatalytic activity, d) selection of at least one 
compartment I in which there are biomolecules fulfilling the desired property, e) division of 
the partial library contained in the selected compartment into further compartments and f) n- 
fold repetition of the steps c) to e) until each compartment contains only one variant of the 
gene sequence coding for the biomolecule. In contrast to established methods which comprise 
mutagenesis and selection steps, said method starts with a large library in which the desired 
variant is contained from the outset. 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE (S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



